Search CORE

8 research outputs found

Disfluency detection using a noisy channel model and deep neural language model

Author: Jamshid Lou Paria
Publication venue: Sydney, Australia : Macquarie University
Publication date
Field of study

Theoretical thesis.Bibliography: pages 43-46.1. Introduction -- 2. Literature review -- 3. LSTM noisy channel model -- 4. Experiments and results -- 5. Summary and conclusions.Although speech recognition technology has improved considerably in recent years, current systems still output simply a sequence of words without any useful information about the location of disfluencies. On the other hand, such information is necessary for improving the readability of speech transcripts. In fact, speech transcripts containing a lot of disfluencies are difficult to understand, so removing disfluent words can make speech transcripts more readable. Moreover, many tasks including dialogue systems input spontaneous speech. Such systems are usually trained on fluent, clean corpora, so inputting disfluent data would decrease their performance. This thesis aims at introducing a model for automatic disfluency detection in spontaneous speech transcripts called LSTM Noisy Channel Model. The model uses a Noisy Channel Model (NCM) to find "rough copies" that are likely to indicate disfluencies and generate n-best candidate disfluency analyses. Then, the underlying fluent sentences of each candidate analysis are scored using a Long Short-Term Memory (LSTM) language model. The LSTM language model scores, along with other features, are used in a reranker to identify the most plausible analysis. We show that using LSTM language model scores as features to rerank the analyses generated by an NCM improves the state of-the-art in disfluency detection.1 online resource (ix, 46 pages

Macquarie University ResearchOnline

ShEMO: a large-scale validated database for Persian speech emotion detection

Author: A Batliner
A James
C Busso
C Lee
F Eyben
GC Cawley
GW Furnas
J Cohen
J Landis
JA Russell
K Scherer
K Scherer
L Breiman
M Ayadi
M Frank
M Grimm
M Hamidi
M Nicolaou
M Wöllmer
Mansoureh Karami
N Alvarado
N Keshtiari
Omid Mohamad Nezami
P Ekman
PA Lewis
Paria Jamshid Lou
R Cowie
T Johnstone
Z Esmaileyan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref